CRS_projections_transformations.Rmd
Changes in the representation of coordinate reference systems (CRS), and of operations on coordinates, that have been occurring over decades must now be implemented in the way spatial objects are handled in R packages. Up to the 1990s, most spatial data simply used the coordinates given by the local mapping authority; for example, the Meuse bank data set used a planar representation in metres, which turned out to be EPSG:28992
, “Amersfoort / RD New.” A major resource for finding out why CRS were specified as they were are Clifford J. Mugnier’s columns in Photogrammetric Engineering & Remote Sensing, references to which are available in rgdal; the Netherlands were covered in the February 2003 column:
td <- tempfile()
dir.create(td)
Sys.setenv("PROJ_USER_WRITABLE_DIRECTORY"=td)
library(rgdal)
## Loading required package: sp
## Please note that rgdal will be retired during 2023,
## plan transition to sf/stars/terra functions using GDAL and PROJ
## at your earliest convenience.
## See https://r-spatial.org/r/2022/04/12/evolution.html and https://github.com/r-spatial/evolution
## rgdal: version: 1.6-4, (SVN revision 1196)
## Geospatial Data Abstraction Library extensions to R successfully loaded
## Loaded GDAL runtime: GDAL 3.6.2, released 2023/01/02
## Path to GDAL shared files: /usr/local/share/gdal
## GDAL does not use iconv for recoding strings.
## GDAL binary built with GEOS: TRUE
## Loaded PROJ runtime: Rel. 9.1.1, December 1st, 2022, [PJ_VERSION: 911]
## Path to PROJ shared files: /tmp/RtmpxkDNCG/file10a14268afdae:/usr/local/share/proj:/usr/local/share/proj
## PROJ CDN enabled: FALSE
## Linking to sp version:1.5-1
## To mute warnings of possible GDAL/OSR exportToProj4() degradation,
## use options("rgdal_show_exportToProj4_warnings"="none") before loading sp or rgdal.
## country month year ISO
## 50 Aruba and the Netherlands Antilles (July) 2002 ABW
## 57 The Kingdom of the Netherlands (February) 2003 NLD
## 243 The Kingdom of the Netherlands (February) 2021 NLD
While most national mapping agencies defined their own standard geographical and projected CRS, supranational bodies, such as military alliances and colonial administrations often imposed some regularity to facilitate operations across national boundaries. This also led to the creation of the European Petroleum Survey Group (EPSG), because maritime jurisdiction was not orderly, and mattered when countries sharing the same coastal shelf tried to assign conflicting exploration concessions. Experts from oil companies accumulated vast experience, which fed through to the International Standards Organization (ISO, especially TC 211) and the Open Geospatial Consortium (OGC).
Defining the CRS became necessary when integrating other data with a different CRS, and for displaying on a web map background. Many legacy file formats, such as the ESRI Shapefile format, did not mandate the inclusion of the CRS of positional data. Most open source software then used PROJ.4 strings as a flexible representation, but as internationally accepted standards have been reached, in particular ISO 19111, and improved over time by iteration, it is really necessary to change to a modern text representation, known as WKT2 (2019). Now it looks as though almost all corporations and mapping agencies accommodate this representation, and it has been adopted by sp through rgdal, sf and other packages.
demo(meuse, ask=FALSE, package="sp", echo=FALSE)
library(mapview)
x <- mapview(meuse, zcol="zinc")
mapshot(x, file="meuse.png")
knitr::include_graphics(system.file("misc/meuse.png", package="rgdal"))
The mapproj package provided coordinate reference system and projection support for the maps package. From mapproj/src/map.h
, line 20, we can see that the eccentricity of the Earth is defined as 0.08227185422
, corrresponding to the Clark 1866 ellipsoid (Iliffe and Lott 2008):
ellps <- projInfo("ellps")
(clrk66 <- ellps[ellps$name=="clrk66",])
## name major ell description
## 16 clrk66 a=6378206.4 b=6356583.8 Clarke 1866
#eval(parse(text=clrk66$major))
#eval(parse(text=clrk66$ell))
a <- 6378206.4
b <- 6356583.8
print(sqrt((a^2-b^2)/a^2), digits=10)
## [1] 0.08227185422
With a very few exceptions, projections included in mapproj::mapproject()
use the Clarke 1866 ellipsoid, with the remainder using a sphere with the Clarke 1866 major axis radius. The function returns coordinates for visualization in an unknown metric; no inverse projections are available.
Like many other free and open source software projects around 2000, R spatial development chose to use the best open source library and infrastructure then available, PROJ.4. Version 4.4 was published by Frank Warmerdam in 2000, based on Gerald Evenden’s earlier work. This earlier work was a library for forward and inverse projection using a key-value string interface to describe the required projection (Evenden 1990). The key-value string is taken as +key=value
, where =value
could be omitted for some keys, and the definition of each projection is built up from space-separated key-value string, such as +proj=utm +zone=25 +south
for Universal Transverse Mercator zone 25 in the southern hemisphere.
Unlike mapproj, PROJ.4 had begun to introduce the +ellps=
key in addition to projection parameters before PROJ.4.4, as some users would not treat Clark 1866 as their natural preference. The need for more care in geodetic specification became pressing from 2000, when civilian use of GPS became as accurate as military use; GPS was typically registered to a more modern geographical CRS than digitized printed maps had been, and most mapping agencies scrambled to update their products and services to “GPS coordinates.”
The ellps=
key was followed by the +datum=
, +nadgrids=
and +towgs84=
keys in successive releases to attempt to specify the geodetic model. The +init=
key appeared to permit the look-up of sets of key-value strings in the packaged version of a given table with a known authority, typically +init=epsg:<code>
, where <code>
was the EPSG code of the coordinate reference system. Where +towgs84=
was given, a three or seven parameter transformation to the WGS84 datum was provided as a comma-separated string, so that the coordinate reference system also included the inverse coordinate operation from projected coordinates to geographical coordinates defined by the WGS84 datum. This led to the need for a placeholder for geographical coordinates, set as +proj=longlat
(or lonlat
, or perhaps with reversed axis order latlong
or latlon
). Some of the issues involved were discussed in a 2010 blog post by Frank Warmerdam.
The PROJ.4 framework functioned well for projection before it was expected to handle datum tranformation too. Within the remit of single mapping agencies, some adaptation could still provide help, say using +nadgrids=
in parts of North America (NAD, North American Datum), but where positional data from multiple agencies was being integrated, the framework was showing its age. For example, from about 2010 it was observed that the +datum=
and +towgs84=
keys sometimes provided contradictory values, leading functions in GDAL reading raster files to prefer +towgs84=
values to +datum=
values. For users for whom accuracy of better than about 150m was irrelevant, using coordinate reference systems correctly defined in terms of the underlying geographical coordinates was less important, but this is still about five Landsat cells.
While the representation of coordinate reference systems (sometimes supplemented by coordinate operations to transform the underlying geographical coordinates to WGS84) as PROJ.4 strings continued to work adequately, changes were occurring. Many file formats chose to use WKT (well-known text) string representations, starting from the 2007 edition of the ISO standard (ISO 2019). This placed the file reading and writing functions offered by GDAL under stress, especially the exportToProj4()
function, as an increasing number of specification components really did not map adequately from the WKT string representation to the PROJ.4 string representation. Another change was that pivoting through a chosen hub when transforming coordinates (in PROJ.4 WGS84) meant that accuracy was lost transforming from the source to the hub, and more was lost from the hub to the target. Why not transform from source to target in one step if possible?
Signalling changes, PROJ.4 changed its name to PR\(\phi\)J, and began burning through major version numbers. PROJ 5 (2018) introduced transformation pipelines (Knudsen and Evers 2017; Evers and Knudsen 2017), representing coordinate operations using a syntax similar to PROJ.4 strings, but showing the whole operation pipeline. PROJ 6 (2019) followed up by shifting from ad-hoc text files for holding coordinate reference system and coordinate operation metadata to an SQLite database. In an increasing number of cases, more accurate coordinate operations could be supported using open access transformation grids, and the grid files needed were now tabulated in the SQLite database. This database is distributed with PROJ, and kept in a directory on the PROJ search path, usually the only or final directory (get_proj_search_paths()
returns a vector of current search paths).
(shpr <- get_proj_search_paths())
## [1] "/tmp/RtmpxkDNCG/file10a14268afdae" "/usr/local/share/proj"
## [3] "/usr/local/share/proj"
library(RSQLite)
db <- dbConnect(SQLite(), dbname=file.path(shpr[length(shpr)], "proj.db"))
dbListTables(db)
## [1] "alias_name"
## [2] "authority_list"
## [3] "authority_to_authority_preference"
## [4] "axis"
## [5] "celestial_body"
## [6] "compound_crs"
## [7] "concatenated_operation"
## [8] "concatenated_operation_step"
## [9] "conversion"
## [10] "conversion_method"
## [11] "conversion_param"
## [12] "conversion_table"
## [13] "coordinate_operation_method"
## [14] "coordinate_operation_view"
## [15] "coordinate_operation_with_conversion_view"
## [16] "coordinate_system"
## [17] "crs_view"
## [18] "deprecation"
## [19] "ellipsoid"
## [20] "extent"
## [21] "geodetic_crs"
## [22] "geodetic_datum"
## [23] "geodetic_datum_ensemble_member"
## [24] "geoid_model"
## [25] "grid_alternatives"
## [26] "grid_packages"
## [27] "grid_transformation"
## [28] "helmert_transformation"
## [29] "helmert_transformation_table"
## [30] "metadata"
## [31] "object_view"
## [32] "other_transformation"
## [33] "prime_meridian"
## [34] "projected_crs"
## [35] "scope"
## [36] "sqlite_stat1"
## [37] "supersession"
## [38] "unit_of_measure"
## [39] "usage"
## [40] "versioned_auth_name_mapping"
## [41] "vertical_crs"
## [42] "vertical_datum"
## [43] "vertical_datum_ensemble_member"
(metadata <- dbReadTable(db, "metadata"))
## key
## 1 DATABASE.LAYOUT.VERSION.MAJOR
## 2 DATABASE.LAYOUT.VERSION.MINOR
## 3 EPSG.DATE
## 4 EPSG.VERSION
## 5 ESRI.DATE
## 6 ESRI.VERSION
## 7 IGNF.DATE
## 8 IGNF.SOURCE
## 9 IGNF.VERSION
## 10 NKG.DATE
## 11 NKG.SOURCE
## 12 NKG.VERSION
## 13 PROJ.VERSION
## 14 PROJ_DATA.VERSION
## value
## 1 1
## 2 2
## 3 2022-08-31
## 4 v10.076
## 5 2022-07-09
## 6 ArcGIS Pro 3.0
## 7 2019-05-24
## 8 https://raw.githubusercontent.com/rouault/proj-resources/master/IGNF.v3.1.0.xml
## 9 3.1.0
## 10 2020-12-21
## 11 https://github.com/NordicGeodesy/NordicTransformations
## 12 1.0.0
## 13 9.1.1
## 14 1.12
PROJ 7 (2020) reconfigured the transformation grids, now using the Geodetic TIFF Grid (GTG) format, and created pathways for on-demand download (typically using a content download network (CDN) over CURL) of chunks of such grids to a local user-writable cache held in another SQLite database. After little change from the late 1990s to early 2018, PROJ.4 has incremented its major version number three times in three years, and by 2021 (PROJ 8), the pre-existing application programming interface will be history. In addition, GDAL 3 (2019) has tightened its links with PROJ >= 6, and exportToProj4()
now says:
“Use of this function is discouraged. Its behavior in GDAL >= 3 / PROJ >= 6 is significantly different from earlier versions. In particular +datum will only encode WGS84, NAD27 and NAD83, and +towgs84/+nadgrids terms will be missing most of the time. PROJ strings to encode CRS should be considered as a a legacy solution. Using a AUTHORITY:CODE or WKT representation is the recommended way” (https://gdal.org/api/ogrspatialref.html).
For these reasons, sf and sp are changing from PROJ.4 strings representing coordinate reference systems to WKT2:2019 strings, as described in this r-spatial blog. Most users who had been relying on file reading to set the coordinate reference systems of objects will not notice much difference, and legacy PROJ.4 strings can still be used to create new, authority-free definitions if need be.
It will be useful to know that in general "OGC:CRS84"
should be used instead of "EPSG:4326"
, because the latter presupposes that Latitude is always ordered before Longitude. "OGC:CRS84"
is the standard representation used by GeoJSON, with coordinates always ordered Longitude before Latitude. This is also the standard GIS and visualization order:
## GEOGCRS["WGS 84 (CRS84)",
## ENSEMBLE["World Geodetic System 1984 ensemble",
## MEMBER["World Geodetic System 1984 (Transit)"],
## MEMBER["World Geodetic System 1984 (G730)"],
## MEMBER["World Geodetic System 1984 (G873)"],
## MEMBER["World Geodetic System 1984 (G1150)"],
## MEMBER["World Geodetic System 1984 (G1674)"],
## MEMBER["World Geodetic System 1984 (G1762)"],
## MEMBER["World Geodetic System 1984 (G2139)"],
## ELLIPSOID["WGS 84",6378137,298.257223563,
## LENGTHUNIT["metre",1]],
## ENSEMBLEACCURACY[2.0]],
## PRIMEM["Greenwich",0,
## ANGLEUNIT["degree",0.0174532925199433]],
## CS[ellipsoidal,2],
## AXIS["geodetic longitude (Lon)",east,
## ORDER[1],
## ANGLEUNIT["degree",0.0174532925199433]],
## AXIS["geodetic latitude (Lat)",north,
## ORDER[2],
## ANGLEUNIT["degree",0.0174532925199433]],
## USAGE[
## SCOPE["Not known."],
## AREA["World."],
## BBOX[-90,-180,90,180]],
## ID["OGC","CRS84"]]
Note that all "GEOGCRS"
"BBOX"
CRS bounding boxes are always specified latitude minimum, longitude minimum, latitude maximum, longitude maximum, even though the declared axis order is the opposite.
The introduction of interactive mapping using mapview and tmap among other packages highlights the need to set the coordinate reference system (CRS) of objects correctly, so that zooming does not reveal embarrassing divergences. Using the location of the Broad Street pump disabled by Dr John Snow to stop a cholera epidemic in Soho, London, in 1854 (Brody et al. 2000), we can start to see what steps are being taken. The point location of the pump is given in projected coordinates, defined in the British National Grid. The workflow used by mapview::mapview()
is to transform first to WGS84 (OGC:CRS84) using sf::st_transform()
if need be, before permitting leaflet to project to Web Mercator (EPSG:3857) internally.
b_pump <- readOGR(system.file("vectors/b_pump.gpkg", package="rgdal"))
## Warning: OGR support is provided by the sf and terra packages among others
## Warning: OGR support is provided by the sf and terra packages among others
## Warning: OGR support is provided by the sf and terra packages among others
## Warning: OGR support is provided by the sf and terra packages among others
## Warning: OGR support is provided by the sf and terra packages among others
## Warning: OGR support is provided by the sf and terra packages among others
## Warning: OGR support is provided by the sf and terra packages among others
## OGR data source with driver: GPKG
## Source: "/tmp/RtmpnkEu0w/temp_libpath101e75200293a/rgdal/vectors/b_pump.gpkg", layer: "b_pump"
## with 1 features
## It has 1 fields
## Integer64 fields read as strings: cat
Reading the file loses the PROJ.4 +datum=
key-value pair, but the WKT2:2019 string is complete:
proj4string(b_pump)
## [1] "+proj=tmerc +lat_0=49 +lon_0=-2 +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy +units=m +no_defs"
if (packageVersion("sp") > "1.4.1") {
WKT <- wkt(b_pump)
} else {
WKT <- comment(slot(b_pump, "proj4string"))
}
cat(WKT, "\n")
## PROJCRS["Transverse_Mercator",
## BASEGEOGCRS["GCS_OSGB 1936",
## DATUM["OSGB_1936",
## ELLIPSOID["Airy 1830",6377563.396,299.3249646,
## LENGTHUNIT["metre",1]]],
## PRIMEM["Greenwich",0,
## ANGLEUNIT["Degree",0.0174532925199433]],
## ID["EPSG",4277]],
## CONVERSION["unnamed",
## METHOD["Transverse Mercator",
## ID["EPSG",9807]],
## PARAMETER["Latitude of natural origin",49,
## ANGLEUNIT["Degree",0.0174532925199433],
## ID["EPSG",8801]],
## PARAMETER["Longitude of natural origin",-2,
## ANGLEUNIT["Degree",0.0174532925199433],
## ID["EPSG",8802]],
## PARAMETER["Scale factor at natural origin",0.999601,
## SCALEUNIT["unity",1],
## ID["EPSG",8805]],
## PARAMETER["False easting",400000,
## LENGTHUNIT["metre",1],
## ID["EPSG",8806]],
## PARAMETER["False northing",-100000,
## LENGTHUNIT["metre",1],
## ID["EPSG",8807]]],
## CS[Cartesian,2],
## AXIS["easting",east,
## ORDER[1],
## LENGTHUNIT["metre",1,
## ID["EPSG",9001]]],
## AXIS["northing",north,
## ORDER[2],
## LENGTHUNIT["metre",1,
## ID["EPSG",9001]]]]
PROJ.4 assumed that the from/source and to/target coordinate reference system definitions involved in coordinate operations each contained the specifications necessary to get from source to WGS84 and then on from WGS84 to target. PR\(\phi\)J drops this assumption, searching among many candidate coordinate operations for viable pipelines. The search is conducted using the tables given in the proj.db
SQLite database, which is now backed by authorities, and regularly updated at each release to the current upstream state (see https://proj.org/operations/operations_computation.html for more details). The tables are searched to find lists of candidates.
cov <- dbReadTable(db, "coordinate_operation_view")
## Warning in result_fetch(res@ptr, n = n): Column `code`: mixed type, first seen
## values of type integer, coercing other values of type string
## Warning in result_fetch(res@ptr, n = n): Column `method_code`: mixed type, first
## seen values of type integer, coercing other values of type string
## Warning in result_fetch(res@ptr, n = n): Column `source_crs_code`: mixed type,
## first seen values of type integer, coercing other values of type string
## Warning in result_fetch(res@ptr, n = n): Column `target_crs_code`: mixed type,
## first seen values of type integer, coercing other values of type string
## table_name code name
## 162 grid_transformation 5338 OSGB36 to ETRS89 (1)
## 163 grid_transformation 5339 OSGB36 to WGS 84 (7)
## 213 grid_transformation 7709 OSGB36 to ETRS89 (2)
## 214 grid_transformation 7710 OSGB36 to WGS 84 (9)
## 755 grid_transformation 108057 ETRS_1989_To_OSGB_1936_OSTN15
## 756 grid_transformation 108058 WGS_1984_To_OSGB_1936_OSTN15
## 757 grid_transformation 108059 OSGB_1936_To_Xrail84_NTv2
## 970 helmert_transformation 1195 OSGB36 to WGS 84 (1)
## 971 helmert_transformation 1196 OSGB36 to WGS 84 (2)
## 972 helmert_transformation 1197 OSGB36 to WGS 84 (3)
## 973 helmert_transformation 1198 OSGB36 to WGS 84 (4)
## 974 helmert_transformation 1199 OSGB36 to WGS 84 (5)
## 1072 helmert_transformation 1314 OSGB36 to WGS 84 (6)
## 1073 helmert_transformation 1315 OSGB36 to ED50 (1)
## 1627 helmert_transformation 5622 OSGB36 to WGS 84 (8)
## 2427 helmert_transformation 108089 OSGB_1936_To_WGS_1984_8_BAD_DX
## 2582 helmert_transformation 108336 OSGB_1936_To_WGS_1984_NGA_7PAR
## method_name accuracy
## 162 NTv2 0.03
## 163 NTv2 1.00
## 213 NTv2 0.03
## 214 NTv2 1.00
## 755 NTv2 0.03
## 756 NTv2 1.00
## 757 NTv2 0.50
## 970 Geocentric translations (geog2D domain) 21.00
## 971 Geocentric translations (geog2D domain) 10.00
## 972 Geocentric translations (geog2D domain) 21.00
## 973 Geocentric translations (geog2D domain) 18.00
## 974 Geocentric translations (geog2D domain) 35.00
## 1072 Position Vector transformation (geog2D domain) 2.00
## 1073 Position Vector transformation (geog2D domain) 2.00
## 1627 Geocentric translations (geog2D domain) 3.00
## 2427 Geocentric translations (geog2D domain) 5.00
## 2582 Coordinate Frame rotation (geog2D domain) 21.00
The same search can be conducted directly without using RSQLite to query the database tables, searching by source and target CRS, and in the near future also by area of interest. If we search using only the degraded PROJ.4 string, we only find a ballpark accuracy coordinate operation, yielding a pipeline with two steps, inverse projection to geographical coordinates in radians, and conversion from radians to degrees. Note that rgdal::spTransform()
and its wrapper sp::spTransform()
use PROJ for coordinate operations. Here we will use "EPSG:4326"
briefly for exposition:
list_coordOps(paste0(proj4string(b_pump), " +type=crs"), "EPSG:4326")
## Candidate coordinate operations found: 1
## Strict containment: FALSE
## Visualization order: TRUE
## Source: +proj=tmerc +lat_0=49 +lon_0=-2 +k=0.999601 +x_0=400000
## +y_0=-100000 +ellps=airy +units=m +no_defs +type=crs
## Target: EPSG:4326
## Best instantiable operation has only ballpark accuracy
## Description: Inverse of unknown + Ballpark geographic offset from unknown to
## WGS 84 + axis order change (2D)
## Definition: +proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2
## +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy
## +step +proj=unitconvert +xy_in=rad +xy_out=deg
The description component “+ axis order change (2D)” refers to the EPSG:4326
definition, which specifies Northings/Latitude as the first axis, and Eastings/Longitude as the second axis; in sp/rgdal workflows, it is assumed that GIS/visualization order with Eastings/Longitude as the first 2D axis and Northings/Latitude as the second axis is preferred. Because the input data are in GIS/visualization order already, the steps to swap axes to standards conformity and then back to GIS/visualization order cancel each other out.
We can see what is happening if we unset enforcing GIS/visualization order, with an axisswap
coming into play:
set_enforce_xy(FALSE)
list_coordOps(paste0(proj4string(b_pump), " +type=crs"), "EPSG:4326")
## Candidate coordinate operations found: 1
## Strict containment: FALSE
## Visualization order: FALSE
## Source: +proj=tmerc +lat_0=49 +lon_0=-2 +k=0.999601 +x_0=400000
## +y_0=-100000 +ellps=airy +units=m +no_defs +type=crs
## Target: EPSG:4326
## Best instantiable operation has only ballpark accuracy
## Description: Inverse of unknown + Ballpark geographic offset from unknown to
## WGS 84
## Definition: +proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2
## +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy
## +step +proj=unitconvert +xy_in=rad +xy_out=deg
## +step +proj=axisswap +order=2,1
set_enforce_xy(TRUE)
While rgdal tries to impose GIS/visualization axis order on the "EPSG:4326"
specification, one may end up with a WKT2 string with Latitude first, followed by Longitude without wishing to do so. Using "OGC:CRS84"
is more secure, because it does not require such ad-hoc re-specification.
Setting the internal control option set_transform_wkt_comment()
to FALSE
, we use only the degraded PROJ.4 string when transforming. spTransform()
undertakes the same search, chooses the best instantiable coordinate operation on its first pass, then uses that pipeline on all objects. The pipeline specification of the coordinate operation may be retrieved using get_last_coordOp()
:
set_transform_wkt_comment(FALSE)
isballpark <- spTransform(b_pump, CRS(SRS_string="OGC:CRS84"))
## Warning: PROJ support is provided by the sf and terra packages among others
## [1] "+proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2 +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy +step +proj=unitconvert +xy_in=rad +xy_out=deg"
The coordinate returned is unfortunately in Ingestre Place, not Broad Street.
print(coordinates(isballpark), digits=10)
## coords.x1 coords.x2
## [1,] -0.1351070464 51.5127926
Let us repeat the search using the WKT2 string; here we see that providing a well-specified CRS representation allows us to choose 2m accuracy for the coordinate operation. Further, we can also see that, had we had access to a named grid, we could have achieved 1m accuracy:
The Helmert transformation has parameters retrieved from the PROJ SQLite database (code 1314):
helm <- dbReadTable(db, "helmert_transformation_table")
## Warning in result_fetch(res@ptr, n = n): Column `code`: mixed type, first seen
## values of type integer, coercing other values of type string
## Warning in result_fetch(res@ptr, n = n): Column `source_crs_code`: mixed type,
## first seen values of type integer, coercing other values of type string
## Warning in result_fetch(res@ptr, n = n): Column `target_crs_code`: mixed type,
## first seen values of type integer, coercing other values of type string
helm[helm$code == "1314", c("auth_name", "code", "name", "tx", "ty", "tz", "rx", "ry", "rz", "scale_difference")]
## auth_name code name tx ty tz rx ry
## 239 EPSG 1314 OSGB36 to WGS 84 (6) 446.448 -125.157 542.06 0.15 0.247
## rz scale_difference
## 239 0.842 -20.489
dbDisconnect(db)
Using the WKT2 CRS representation, we can achieve 2m accuracy (or as the table says in other fields: “Oil exploration. Accuracy better than 4m and generally better than 2m”:
set_transform_wkt_comment(TRUE)
is2m <- spTransform(b_pump, CRS(SRS_string="OGC:CRS84"))
## Warning: PROJ support is provided by the sf and terra packages among others
## [1] "+proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2 +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy +step +proj=push +v_3 +step +proj=cart +ellps=airy +step +proj=helmert +x=446.448 +y=-125.157 +z=542.06 +rx=0.15 +ry=0.247 +rz=0.842 +s=-20.489 +convention=position_vector +step +inv +proj=cart +ellps=WGS84 +step +proj=pop +v_3 +step +proj=unitconvert +xy_in=rad +xy_out=deg"
The output point is close to the Broad Street pump:
print(coordinates(is2m), digits=10)
## coords.x1 coords.x2
## [1,] -0.1367127374 51.51330218
It is over 100m West-North-West of the Ingestre Place position:
## [1] 125.0577
c(maptools::gzAzimuth(coordinates(isballpark), coordinates(is2m)))
## coords.x1
## -62.98022
This was about as good as one could get prior to PROJ 7 without downloading the missing grid file manually, and installing the downloaded file in a directory that would usually not be user-writable. The whole set of grids can be downloaded and installed manually for workgroups needing to be sure that the same grids are available to all users, as has been the case in the past as well.
The rgdal::project()
uses the underlying geographical coordinate reference system, and does not transform, so using the degraded PROJ.4 string and WKT2 give the same output, and because the input is projected, we take the inverse:
(a <- project(coordinates(b_pump), proj4string(b_pump), inv=TRUE, verbose=TRUE))
## Warning: PROJ support is provided by the sf and terra packages among others
## Warning: PROJ support is provided by the sf and terra packages among others
## +proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2 +k=0.999601
## +x_0=400000 +y_0=-100000 +ellps=airy +step +proj=unitconvert +xy_in=rad
## +xy_out=deg
## coords.x1 coords.x2
## [1,] -0.135107 51.51279
(b <- project(coordinates(b_pump), WKT, inv=TRUE))
## Warning: PROJ support is provided by the sf and terra packages among others
## Warning: PROJ support is provided by the sf and terra packages among others
## coords.x1 coords.x2
## [1,] -0.135107 51.51279
The projected points only inverse project the projected coordinates using the specified projection
all.equal(a, b)
## [1] TRUE
c(spDists(coordinates(isballpark), a)*1000)
## [1] 0
list_coordOps(WKT, "OGC:CRS84")
## Candidate coordinate operations found: 9
## Strict containment: FALSE
## Visualization order: TRUE
## Source: PROJCRS["Transverse_Mercator", BASEGEOGCRS["GCS_OSGB 1936",
## DATUM["OSGB_1936", ELLIPSOID["Airy 1830", 6377563.396,
## 299.3249646, LENGTHUNIT["metre", 1]]],
## PRIMEM["Greenwich", 0, ANGLEUNIT["Degree",
## 0.0174532925199433]], ID["EPSG", 4277]],
## CONVERSION["unnamed", METHOD["Transverse Mercator",
## ID["EPSG", 9807]], PARAMETER["Latitude of natural
## origin", 49, ANGLEUNIT["Degree", 0.0174532925199433],
## ID["EPSG", 8801]], PARAMETER["Longitude of natural
## origin", -2, ANGLEUNIT["Degree", 0.0174532925199433],
## ID["EPSG", 8802]], PARAMETER["Scale factor at natural
## origin", 0.999601, SCALEUNIT["unity", 1], ID["EPSG",
## 8805]], PARAMETER["False easting", 400000,
## LENGTHUNIT["metre", 1], ID["EPSG", 8806]],
## PARAMETER["False northing", -100000,
## LENGTHUNIT["metre", 1], ID["EPSG", 8807]]],
## CS[Cartesian, 2], AXIS["easting", east, ORDER[1],
## LENGTHUNIT["metre", 1, ID["EPSG", 9001]]],
## AXIS["northing", north, ORDER[2], LENGTHUNIT["metre",
## 1, ID["EPSG", 9001]]]]
## Target: OGC:CRS84
## Best instantiable operation has accuracy: 2 m
## Description: Inverse of unnamed + OSGB36 to WGS 84 (6) + axis order change
## (2D)
## Definition: +proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2
## +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy
## +step +proj=push +v_3 +step +proj=cart +ellps=airy
## +step +proj=helmert +x=446.448 +y=-125.157
## +z=542.06 +rx=0.15 +ry=0.247 +rz=0.842 +s=-20.489
## +convention=position_vector +step +inv +proj=cart
## +ellps=WGS84 +step +proj=pop +v_3 +step
## +proj=unitconvert +xy_in=rad +xy_out=deg
## Operation 8 is lacking 1 grid with accuracy 1 m
## Missing grid: uk_os_OSTN15_NTv2_OSGBtoETRS.tif
## URL: https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif
The search for candidate coordinate operations may be expedited if the area of interest is provided. This is a vector of minumum longitude and latitude followed by maximum longitude and latitude, so we need to inverse project the bounding box of projected sp objects and coerce that matrix into the required order:
if (is.projected(b_pump)) {
o <- project(t(unclass(bbox(b_pump))), wkt(b_pump), inv=TRUE)
} else {
o <- t(unclass(bbox(b_pump)))
}
## Warning: PROJ support is provided by the sf and terra packages among others
## Warning: PROJ support is provided by the sf and terra packages among others
## [1] -0.23510705 51.41279260 -0.03510705 51.61279260
Without an area of interest, the coordinate operation look-up found 8 candidate operations, with a given area of interest, only 5 are found with the strict_containment=
argument taking its default value of FALSE
, so that the candidates found do not have to contain the area of interest strictly:
nrow(list_coordOps(WKT, "OGC:CRS84", area_of_interest=aoi))
## [1] 5
In this case, changing this argument to TRUE
makes no further difference:
nrow(list_coordOps(WKT, "OGC:CRS84", strict_containment=TRUE, area_of_interest=aoi))
## [1] 5
Methods for projection and transformation use the area of interest implicit in the data object, controlled by an argument: use_aoi
, default TRUE
, for +proj=ob_tran,
FALSE`.
PROJ 7 introduced on-demand downloading of (chunks of) transformation grids from a content delivery network to a user-writable directory on the PROJ search path (usually the first path component). The status of the downloaded grids is stored in another SQLite database, cache.db
. The PR\(\phi\)J user-writable CDN directory is set as soon as the internal search path is queried, and for most uses, the default value will allow all programs using PR\(\phi\)J such as R packages, QGIS, GRASS, etc., to access any downloaded grids. Grids are checked for staleness at regular intervals. This directory may be set to a non-default value with the PROJ_USER_WRITABLE_DIRECTORY
environment variable before rgdal (and any other package using PR\(\phi\)J) is loaded and attached, from PR\(\phi\)J >= 7.1.0. The code used at the beginning of this vignette is repeated here for convenience:
td <- tempfile()
dir.create(td)
Sys.setenv("PROJ_USER_WRITABLE_DIRECTORY"=td)
library(rgdal)
Let us check that rgdal is running with on-demand downloading disabled:
## [1] FALSE
We can enable on-demand download using a function that reports the value of the PR\(\phi\)J CDN user-writable directory; we see that with this setting, network download is enabled:
## [1] "Using: /tmp/RtmpxkDNCG/file10a14268afdae"
## [1] TRUE
The reader will recall that the search path was stored in shpr
above, the first element being the user-writable directory; at this point no SQLite database has been created:
shpr[1]
## [1] "/tmp/RtmpxkDNCG/file10a14268afdae"
## [1] NA
When we then search for candidate coordinate operations, we see that the operation using an absent grid now sees that download is enabled, and proposes the 1m accuracy candidate, because the required grid can be downloaded (again using the area of interest):
list_coordOps(WKT, "OGC:CRS84", area_of_interest=aoi)
## Candidate coordinate operations found: 5
## Search specification: Area of interest: -0.235107046388033, 51.4127925984147, -0.0351070463880326, 51.6127925984147
## Strict containment: FALSE
## Visualization order: TRUE
## Source: PROJCRS["Transverse_Mercator", BASEGEOGCRS["GCS_OSGB 1936",
## DATUM["OSGB_1936", ELLIPSOID["Airy 1830", 6377563.396,
## 299.3249646, LENGTHUNIT["metre", 1]]],
## PRIMEM["Greenwich", 0, ANGLEUNIT["Degree",
## 0.0174532925199433]], ID["EPSG", 4277]],
## CONVERSION["unnamed", METHOD["Transverse Mercator",
## ID["EPSG", 9807]], PARAMETER["Latitude of natural
## origin", 49, ANGLEUNIT["Degree", 0.0174532925199433],
## ID["EPSG", 8801]], PARAMETER["Longitude of natural
## origin", -2, ANGLEUNIT["Degree", 0.0174532925199433],
## ID["EPSG", 8802]], PARAMETER["Scale factor at natural
## origin", 0.999601, SCALEUNIT["unity", 1], ID["EPSG",
## 8805]], PARAMETER["False easting", 400000,
## LENGTHUNIT["metre", 1], ID["EPSG", 8806]],
## PARAMETER["False northing", -100000,
## LENGTHUNIT["metre", 1], ID["EPSG", 8807]]],
## CS[Cartesian, 2], AXIS["easting", east, ORDER[1],
## LENGTHUNIT["metre", 1, ID["EPSG", 9001]]],
## AXIS["northing", north, ORDER[2], LENGTHUNIT["metre",
## 1, ID["EPSG", 9001]]]]
## Target: OGC:CRS84
## Best instantiable operation has accuracy: 1 m
## Description: Inverse of unnamed + OSGB36 to WGS 84 (9) + axis order change
## (2D)
## Definition: +proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2
## +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy
## +step +proj=hgridshift
## +grids=uk_os_OSTN15_NTv2_OSGBtoETRS.tif +step
## +proj=unitconvert +xy_in=rad +xy_out=deg
On making the transformation, we may see that the coordinate operation takes longer than expected, because on first pass the grid is downloaded from the network:
system.time(is1m <- spTransform(b_pump, CRS(SRS_string="OGC:CRS84")))
## Warning: PROJ support is provided by the sf and terra packages among others
## user system elapsed
## 0.081 0.010 0.215
The coordinate operation used now specifies the grid in the pipeline:
## [1] "+proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2 +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy +step +proj=hgridshift +grids=uk_os_OSTN15_NTv2_OSGBtoETRS.tif +step +proj=unitconvert +xy_in=rad +xy_out=deg"
The coordinate values differ little from the 2m accuracy Helmert pipeline:
print(coordinates(is1m), digits=10)
## coords.x1 coords.x2
## [1,] -0.136687626 51.5133037
as we can see, the 1m accuracy point is 1.7m from the 2m accuracy point, just to the West:
## [1] 1.751474
c(maptools::gzAzimuth(coordinates(is1m), coordinates(is2m)))
## coords.x1
## -95.57299
Now the SQLite grid cache database has been created and has grown in size:
## [1] 319488
If we look in the SQLite database of downloaded grids, we see that the grid components that were downloaded. Here we have not yet used the area of interest to limit the number of chunks involved:
library(RSQLite)
db <- dbConnect(SQLite(), dbname=file.path(shpr[1], "cache.db"))
(tbs <- dbListTables(db))
## [1] "chunk_data" "chunks"
## [3] "downloaded_file_properties" "linked_chunks"
## [5] "linked_chunks_head_tail" "properties"
## [7] "sqlite_sequence"
## id url offset data_id
## 1 1 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 0 1
## 2 2 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2621440 2
## 3 3 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2637824 3
## 4 4 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2654208 4
## 5 5 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2670592 5
## 6 6 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2686976 6
## 7 7 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2703360 7
## 8 8 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2719744 8
## 9 9 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2736128 9
## 10 10 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2752512 10
## 11 11 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2768896 11
## 12 12 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2785280 12
## 13 13 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2801664 13
## 14 14 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2818048 14
## 15 15 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2834432 15
## 16 16 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2850816 16
## data_size
## 1 16384
## 2 16384
## 3 16384
## 4 16384
## 5 16384
## 6 16384
## 7 16384
## 8 16384
## 9 16384
## 10 16384
## 11 16384
## 12 16384
## 13 16384
## 14 16384
## 15 16384
## 16 16384
dbDisconnect(db)
Finally, we disable grid download to return to the status existing when rgdal was attached; the cached grids in this case, using an R temporary directory, will be discarded, but in usual workflows, grids are downloaded once, used often and updated seldom when the server versions change.
## [1] FALSE
The outcome positions are shown here; at zoom 18 we can see that the 1m accuracy green point matches the Open Street Map location of the pump very well:
library(mapview)
x <- mapview(is2m, map.type="OpenStreetMap", legend=FALSE) + mapview(is1m, col.regions="green", legend=FALSE) + mapview(isballpark, col.regions="red", legend=FALSE)
mapshot(x, file="snow.png")
knitr::include_graphics(system.file("misc/snow.png", package="rgdal"))
Package authors of the almost 150 packages with sp objects with outdated "CRS"
objects stored in *.rda
or *RData
files as data sets should update the "proj4string"
slots of these objects and re-store. The GGHB.IG
data set in the CARBayesdata package (version 2.1) can serve as an example.
# library(CARBayesdata)
library(sp)
# data(GGHB.IG)
# orig <- slot(GGHB.IG, "proj4string")
(load(system.file("misc/GGHB.IG_CRS.rda", package="rgdal")))
## [1] "orig"
orig
## Coordinate Reference System:
## Deprecated Proj.4 representation:
## +proj=tmerc +lat_0=49 +lon_0=-2 +k=0.9996012717 +x_0=400000
## +y_0=-100000 +datum=OSGB36 +units=m +no_defs +ellps=airy
## +towgs84=446.448,-125.157,542.060,0.1502,0.2470,0.8421,-20.4894
The original "CRS"
object contains a +datum=
key (deprecated in GDAL’s exportToProj4()
from GDAL 3) and a +towgs84=
key, the handling of which has varied in different releases of PROJ >= 6 and GDAL 3. From rgdal 1.5-17 (development version), a PROJ function proj_get_source_crs
is used by default, but may be avoided by setting the sp::CRS()
get_source_if_boundcrs=
argument to FALSE. The function will return the source CRS of a BOUNDCRS
object. In some versions of PROJ/GDAL, the inclusion of a +towgs84=
key is taken as meaning that the user wishes to create an object with the given source CRS, but because a +towgs84=
key is present, it is assumed that the target is WGS 84
with the given transformation method. Proj4 strings with +towgs84=
keys have no authority for the parameters given, and do not harmonise with the use of the EPSG database.
To run the examples, we need the current development version of sp from "rsbivand/sp"
:
Running sp::CRS()
takes the original Proj4 key-value pairs, and converts them into a WKT2 representation as well as returning an (unused) Proj4 string. This is kept in place for packages still expecting to see/use it, but all rgdal functions and methods use the WKT2 representation. Here, we do not try to handle the BOUNDCRS
special case, something that leads to difficulties with some versions of PROJ later in workflows. The BOUNDCRS
contains a SOURCECRS
and a TARGETCRS
(which also has swapped axes), while really all that was needed was the contents of the SOURCECRS
:
## PROJCRS["unknown",
## BASEGEOGCRS["unknown",
## DATUM["OSGB 1936",
## ELLIPSOID["Airy 1830",6377563.396,299.3249646,
## LENGTHUNIT["metre",1]],
## ID["EPSG",6277]],
## PRIMEM["Greenwich",0,
## ANGLEUNIT["degree",0.0174532925199433],
## ID["EPSG",8901]]],
## CONVERSION["unknown",
## METHOD["Transverse Mercator",
## ID["EPSG",9807]],
## PARAMETER["Latitude of natural origin",49,
## ANGLEUNIT["degree",0.0174532925199433],
## ID["EPSG",8801]],
## PARAMETER["Longitude of natural origin",-2,
## ANGLEUNIT["degree",0.0174532925199433],
## ID["EPSG",8802]],
## PARAMETER["Scale factor at natural origin",0.9996012717,
## SCALEUNIT["unity",1],
## ID["EPSG",8805]],
## PARAMETER["False easting",400000,
## LENGTHUNIT["metre",1],
## ID["EPSG",8806]],
## PARAMETER["False northing",-100000,
## LENGTHUNIT["metre",1],
## ID["EPSG",8807]]],
## CS[Cartesian,2],
## AXIS["(E)",east,
## ORDER[1],
## LENGTHUNIT["metre",1,
## ID["EPSG",9001]]],
## AXIS["(N)",north,
## ORDER[2],
## LENGTHUNIT["metre",1,
## ID["EPSG",9001]]]]
Using the default setting to remove the unintended BOUNDCRS
representation, we can get what was (probably) intended (when the sp version is as was before this work-around was added, the following two chunks yield BOUNDCRS
for some versions of PROJ/GDAL:
## PROJCRS["unknown",
## BASEGEOGCRS["unknown",
## DATUM["OSGB 1936",
## ELLIPSOID["Airy 1830",6377563.396,299.3249646,
## LENGTHUNIT["metre",1]],
## ID["EPSG",6277]],
## PRIMEM["Greenwich",0,
## ANGLEUNIT["degree",0.0174532925199433],
## ID["EPSG",8901]]],
## CONVERSION["unknown",
## METHOD["Transverse Mercator",
## ID["EPSG",9807]],
## PARAMETER["Latitude of natural origin",49,
## ANGLEUNIT["degree",0.0174532925199433],
## ID["EPSG",8801]],
## PARAMETER["Longitude of natural origin",-2,
## ANGLEUNIT["degree",0.0174532925199433],
## ID["EPSG",8802]],
## PARAMETER["Scale factor at natural origin",0.9996012717,
## SCALEUNIT["unity",1],
## ID["EPSG",8805]],
## PARAMETER["False easting",400000,
## LENGTHUNIT["metre",1],
## ID["EPSG",8806]],
## PARAMETER["False northing",-100000,
## LENGTHUNIT["metre",1],
## ID["EPSG",8807]]],
## CS[Cartesian,2],
## AXIS["(E)",east,
## ORDER[1],
## LENGTHUNIT["metre",1,
## ID["EPSG",9001]]],
## AXIS["(N)",north,
## ORDER[2],
## LENGTHUNIT["metre",1,
## ID["EPSG",9001]]]]
The sp::rebuild_CRS()
method for "CRS"
objects (and thus that for "Spatial"
objects) can be used to do the same as using sp::CRS()
directly, with special treatment for a number of other objects inheriting from "Spatial"
:
orig1b <- rebuild_CRS(orig)
cat(wkt(orig1b), "\n")
## PROJCRS["unknown",
## BASEGEOGCRS["unknown",
## DATUM["OSGB 1936",
## ELLIPSOID["Airy 1830",6377563.396,299.3249646,
## LENGTHUNIT["metre",1]],
## ID["EPSG",6277]],
## PRIMEM["Greenwich",0,
## ANGLEUNIT["degree",0.0174532925199433],
## ID["EPSG",8901]]],
## CONVERSION["unknown",
## METHOD["Transverse Mercator",
## ID["EPSG",9807]],
## PARAMETER["Latitude of natural origin",49,
## ANGLEUNIT["degree",0.0174532925199433],
## ID["EPSG",8801]],
## PARAMETER["Longitude of natural origin",-2,
## ANGLEUNIT["degree",0.0174532925199433],
## ID["EPSG",8802]],
## PARAMETER["Scale factor at natural origin",0.9996012717,
## SCALEUNIT["unity",1],
## ID["EPSG",8805]],
## PARAMETER["False easting",400000,
## LENGTHUNIT["metre",1],
## ID["EPSG",8806]],
## PARAMETER["False northing",-100000,
## LENGTHUNIT["metre",1],
## ID["EPSG",8807]]],
## CS[Cartesian,2],
## AXIS["(E)",east,
## ORDER[1],
## LENGTHUNIT["metre",1,
## ID["EPSG",9001]]],
## AXIS["(N)",north,
## ORDER[2],
## LENGTHUNIT["metre",1,
## ID["EPSG",9001]]]]
In practice, sp::rebuild_CRS()
methods may be used, but users should check the output for coherence, and the PROJ function for extracting SOURCECRS
from BOUNDCRS
avoided if it has unfortunate consequences.